home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Columbia Kermit
/
kermit.zip
/
newsgroups
/
misc.19990725-20000114
/
000225_news@columbia.edu _Sat Oct 23 14:57:27 1999.msg
< prev
next >
Wrap
Internet Message Format
|
2020-01-01
|
3KB
Return-Path: <news@columbia.edu>
Received: from newsmaster.cc.columbia.edu (newsmaster.cc.columbia.edu [128.59.59.30])
by watsun.cc.columbia.edu (8.8.5/8.8.5) with ESMTP id OAA16301
for <kermit.misc@watsun.cc.columbia.edu>; Sat, 23 Oct 1999 14:57:26 -0400 (EDT)
Received: (from news@localhost)
by newsmaster.cc.columbia.edu (8.8.5/8.8.5) id OAA19194
for kermit.misc@watsun.cc.columbia.edu; Sat, 23 Oct 1999 14:36:01 -0400 (EDT)
X-Authentication-Warning: newsmaster.cc.columbia.edu: news set sender to <news> using -f
From: fdc@watsun.cc.columbia.edu (Frank da Cruz)
Subject: Re: KOI80-u (was DEC-C version 6 question)
Date: 23 Oct 1999 18:36:00 GMT
Organization: Columbia University
Message-ID: <7usv6g$inb$1@newsmaster.cc.columbia.edu>
To: kermit.misc@columbia.edu
In article <Pine.BSF.4.05.9910231910520.15947-100000@ff.dsu.dp.ua>,
Dmitry Pryanishnikov <dmitry@digital.dp.ua> wrote:
: On Sat, 23 Oct 1999, Frank da Cruz wrote:
: > : 255 FF U042A CYRILLIC CAPITAL LETTER HARD SIGN
: > :
: > This is the first time I heard of KOI8-u. How does it differ from
: > KOI8-r? Can you tell me where I can find a code chart for KOI8-u?
:
: Is suspect the full RFC is too large to post it here... So I put here
: just heading:
:
Thanks. This is somewhat off-topic for the VMS newsgroup (sorry) but...
KOI8 is a becoming a very confusing term! The first KOI8 was GOST (USSR
Standard) 19768-1974. It was published outside the USSR as ECMA-113
Edition 1. In this version, the Cyrillic letters line up with the Roman
ones "by sound" (there is also a 7-bit version called Short KOI or KOI7,
in which Roman and Cyrillic exist side by side but only in uppercase, also
aligned by sound).
The original KOI8 was superseded by GOST 19768-1987 and published as
ECMA-113 Second Edition and as ISO 8859-5, the ISO Latin/Cyrillic Alphabet
(which, by the way, has the Ukranian and Belorussion characters). This one
is sometimes also called "New KOI8" and the previous one is called "Old
KOI8".
Now I realize that Old KOI8 is used widely in netnews and email instead of
ISO 8859-5, and I understand why: if you don't have a Cyrillic font or code
page, you can strip off the 8th bit and still read Old KOI8 text in Roman
letters "by sound". The part I did not realize before was that line- and
box-drawing characters had been added to Old KOI8 in columns 8 through B.
When did that happen? Where is it documented? (And how could graphic
characters be defined for columns 8 and 9, which are reserved for C1
control characters? -- This makes it incompatible with ISO 2022). And what
do we call this Old KOI8 with box drawing? "New Old KOI8?"
My other question is: RFC2319.TXT states that the four Ukrainian letters
(8 code points) are added at A4, A6, A7, AD, B4, B6, B7, and BD. What
characters in "New-Old KOI8" were replaced, if any?
If I discover the answers to these questions, maybe I can add support for
KOI8-u and "New Old KOI8-r" to C-Kermit 7.0. ("Old Old KOI8[-r]" is
already supported, as are ISO 8859-5, CP855, CP866, CP1251, Short KOI, and
now also the UCS).
- Frank